Evolution Strategies for Direct Policy Search

نویسندگان

  • Verena Heidrich-Meisner
  • Christian Igel
چکیده

The covariance matrix adaptation evolution strategy (CMAES) is suggested for solving problems described by Markov decision processes. The algorithm is compared with a state-of-the-art policy gradient method and stochastic search on the double cart-pole balancing task using linear policies. The CMA-ES proves to be much more robust than the gradient-based approach in this scenario.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Strategies for Optimal Implementation of Evolution and Innovation Packages in Medical Education

ABSTRACT BACKGROUND AND OBJECTIVE: Evolution and innovation packages in medical science education are the main program of medical education and it is necessary to pay attention to the provision of infrastructure of their implementation. This study was conducted to identify effective strategies for optimal implementation of evolution and innovation packages in medical education. METHODS: The met...

متن کامل

Neuro-Evolution for Multi-Agent Policy Transfer in RoboCup Keep-Away

An objective of transfer learning is to improve and speedup learning on target tasks after training on a different, but related source tasks. This research is a study of comparative Neuro-Evolution (NE) methods for transferring evolved multi-agent policies (behaviors) between multi-agent tasks of varying complexity. The efficacy of five variants of two NE methods are compared for multi-agent po...

متن کامل

Neuro-Evolution for Multi-Agent Policy Transfer in RoboCup Keep-Away: (Extended Abstract)

An objective of transfer learning is to improve and speedup learning on target tasks after training on a different, but related source tasks. This research is a study of comparative Neuro-Evolution (NE) methods for transferring evolved multi-agent policies (behaviors) between multi-agent tasks of varying complexity. The efficacy of five variants of two NE methods are compared for multi-agent po...

متن کامل

Digital Direct-to-Consumer Advertising: A Perfect Storm of Rapid Evolution and Stagnant Regulation; Comment on “Trouble Spots in Online Direct-to-Consumer Prescription Drug Promotion: A Content Analysis of FDA Warning Letters”

The adoption and use of digital forms of direct-to-consumer advertising (also known as “eDTCA”) is on the rise. At the same time, the universe of eDTCA is expanding, as technology on Internet-based platforms continues to evolve, from static websites, to social media, and nearly ubiquitous use of mobile devices. However, little is known about how this unique form of pharmaceutical marketing impa...

متن کامل

Structural-interpretative modeling of strategies for achieving the mission of education in an entrepreneurial and community-oriented university

Considering the importance of transitioning economies from a resource-based economy to a knowledge-based one, and the importance of the role of universities in the development of innovative and entrepreneurial activities, the subject of entrepreneurial university and the strategies necessary for its formation have attracted the attention of many researchers and policy makers in the field of hig...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008